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Abstract 

In this work we consider random Boolean networks that provide a 
general model for genetic regulatory networks. We extend the analysis of 
James Lynch who was able to proof Kauffman's conjecture that in the 
ordered phase of random networks, the number of ineffective and freezing 
gates is large, where as in the disordered phase their number is small. 
Lynch proved the conjecture only for networks with connectivity two and 
non-uniform probabilities for the Boolean functions. We show how to 
apply the proof to networks with arbitrary connectivity K and to random 
networks with biased Boolean functions. It turns out that in these cases 
Lynch's parameter A is equivalent to the expectation of average sensitivity 
of the Boolean functions used to construct the network. Hence we can 
apply a known theorem for the expectation of the average sensitivity. In 
order to prove the results for networks with biased functions, we deduct 
the expectation of the average sensitivity when only functions with specific 
connectivity and specific bias are chosen at random. 
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1 Introduction 

In 1969 Stuart Kauffman started to study random Boolean networks as simple 
models of genetic regulatory networks [1]. Random Boolean networks that con- 
sists of a set of Boolean gates that are capable of storing a single Boolean value. 
At discrete time steps these gates store a new value according to an initially 
chosen random Boolean function, which receives its inputs from random chosen 
gates. We will give a more formal definition later. Kauffman made numerical 
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studies of random networks, where the functions are chosen from the set of all 
Boolean functions with K arguments (the so called NK -Networks). He recog- 
nised that if K < 2, the random networks exhibit a remarkable form of ordered 
behaviour: The limit cycles are small, the number of ineffective gates, which are 
gates that can be perturbed without changing the asymptotic behaviour, and 
the number of freezing gates that stop changing their state is large. In contrast 
if K > 3, the networks do not exhibit this kind of ordered behaviour (see [1, 2]). 
The first analytical proof for this phase transition was given by Dcrrida and 
Pomeau (see [3]) by studying the evolution of the Hamming distance of random 
chosen initial states by means of so called annealed approximation. The first 
proof for the number of freezing and ineffective gates was given by James Lynch 
(see [4], although slightly weaker results appeared earlier [5, 6]). Depending on 
a parameter A, that depends on the probabilities of the Boolean functions, he 
showed that if A < 1 almost all gates are ineffective and freezing, otherwise not. 
Although his analysis is very general, until now it was only applied to networks 
with connectivity 2 and non-uniform probabilities for the Boolean function: if 
the probability of choosing a constant function is larger or equal the probability 
of choosing a non-constant non-canalizing function (namely the XOR- or the 
inverted XO R- function) , A is less or equal to one. But it turns out that in some 
cases A is equal to the expectation of the average sensitivity. Therefore we will 
first study the average sensitivity in Section 3. Afterwards it will be shown 
in Section 4 how to use the results from the previous section to apply Lynch's 
analysis to classical NK- Networks and biased random Boolean networks 1 . But 
first we will give some basic definition used throughout the paper in Section 2. 

2 Basic Definitions 

In the following F2 = {0, 1} denotes the Galois field of two elements, where 
addition, denoted by ©, is defined modulo 2. The set of vectors of length K 
over F2 will be denoted by Ff . If x is a vector from F^", its zth component 
will be denoted by Xi. With u« e Ff we will denote the unit vector which has 
all components zero except component i which is one. The Hamming weight of 
x e ¥2 is defined as 

w H {x) = \{i\xi^0,i = l,...,K}\ 

and the Hamming distance of x, y G Ff" as 

dH(x,y) = w H (x0y). 

A Boolean function is a mapping / : Ff — > F 2 . A function / may be represented 
by its truth table t / , that is, a vector in F| , where each component of the truth 
table gives the value of / for one of the 2 K possible arguments. To fix an order 
on the components of the truth table, suppose that its ith component equals 
the value of the corresponding function, given the binary representation (to K 
bits) of i as an argument. 

J a definition will be given later 
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3 Average Sensitivity 



In this section we will focus on the average sensitivity. The average sensitivity 
is a known complexity measure for Boolean functions, see for example [7] 2 . It 
was already used to study Boolean and random Boolean networks for example 
in [8, 9]. 

Definition 1. Let f denote a Boolean function Ff — > F 2 and a unit vector. 

1. The sensitivity S/(w) is defined as: 

*/(w) = |{t | /(w) ^ /(w © u«),i = 1, . . . , K}\ . 

2. The average sensitivity Sf is defined as the average o/s/(w) over all w G 



wEF~ 



Now consider the random variable Fk ■ — ► .Tve, where denotes the set 
a all 2 2 Boolean function with K arguments. The probability measure is given 
by P(Fk = /) = T^pr- The expected value of the average sensitivity of this 
random variable is denoted by Ep K (sf), and is given by 

f 

The expected value was already derived in [10], and is given by: 
Theorem 1 (Bernasconi [10]). 

Let the random variable Fk be defined as above, then 



We will now concentrate on biased Boolean functions. The bias of a Boolean 
function / : ¥^ — > F 2 is defined as the number of 1 in the functions truth table 
divided by 2 K . To define the bias of a random Boolean function two definitions 
are possible. First we can assumes that the truth tables of the Boolean functions 
are produced by independent Bernoulli trials with probability p for a one (This 
should be called mean bias, used for example in [3, 8] ). Therefore consider the 
random variable Fk, p - The probability of choosing a function / is given by 

P{F K , P = f) =p WH(t ^(l -p) 2 *"™'*') 
For p = 1/2 this is equivalent to the definition of Fk- 

2 here it is called critical complexity 
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As a second possibility, we can only choose functions which have bias p 
whereas to all other functions we assign probability (we will call this fixed 
bias). Therefore consider the random variables Ff^° d : ft — > Tk- Denote the 
truth table of a function / by tf. Further denote the set of all Boolean functions 
/ with K arguments and wjj(t /) = p2 K with Tk, p - The probability for a certain 
function chosen according F^ d is given by 

p( Ft Kcd = n = J lf - f e Tk v 

1 K ' P f) [0 KftFK* 

Both definitions ensure that the expectation to get a one is equal to p if the 
input of a function is chosen at random (with respect to uniform distribution). 
But it will turn out that these two different methods of creating biased Boolean 
functions, have a major impact on the average sensitivity. 

The expectation of the average sensitivity of Fk, p was derived in [8] : 

Theorem 2 ([8]). Let the random variable Fk, p be defined as above: 

E FK J Sf )=2Kp(l-p) 

For the random variable F^ d we will now proof the following theorem: 
Theorem 3. Let the random variable F^ d be defined as above: 

2 K+1 Kp(l-p) 



E F fi^(s f ) = 



(2* 1) 



Proof. To find E i? Hxed(s/) we will first consider the random variable Fjt,t '■ ^ ~~ * 
Tic where t e {0, 1, • • • , 2 K } and the probability of a function is given by 

{-pr- if w H (t/) = t 
else 

Consider the Boolean functions as functions into R by identifying 0, 1 € F2 
with 0,1 G R. Then we get or the function /: 

s f = 2~ K ]T |{i|/(w)^/(uWew),i = l,...,A-}| 

K 

= ^ K E E(/( w )-/( w ® uW )) 2 

wGFf »=1 
K 

= 2 ~ K E 2(/(w)+/(weu«)-2/(w)/(weu«)). 

wGFf »=1 
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where again denotes the unit vector with zth component set to 1. Hence by 
the linearity of the expectation 

K 

E FK , t (s f ) = 2- K £(E^J/(w))+E F ^/(w©u«)) 

wGFf »=1 (1) 



- 2E Fw (/(w)/(w©u«))) 



Now we form a matrix with the truth tables of all functions with Hamming 
weight t as column vectors: 



M 



(c«, c< 2 ),---, c^ 2 ")') where c« £ F 



M has exactly ( 2 t ) columns and 2 K rows. Each entry Mjj in the ith row and 
jth column equals the value of function fj given the binary representation of i 
as input. 

Hence Ep Kt (f(w)) is determined by the number of 1 in the row associated 
with w divided by the length of the row. Consider an arbitrary row i. This row 
has a one at position j if the corresponding column has a one at position i. 
But there are ( t S\) column vectors with a 1 at position i. It follows: 

VweFf: E FK , t (/(w)) = %^ = -^. (2) 

it) Z 

As this holds for all w, we have 

Vw,u«GFf: E F ,, ( (/(w$u«)) = i. (3) 

To find an expression for E F fixed(/(w)/(w©uW)) we consider two arbitrary 
rows l,m (I ^ m). Define the following sum: 

(?) 

Obviously M ;>i M m ^ = 1 only if we have a 1 in both rows at position i. This 
means for the column vectors of M, we have cf 1 = = 1. But there are 
exactly ( 2 t _^ 2 ) such column vectors in M. Therefore we have 

f2 K - 2 

V7, m, I 7^ m : 7/. 



t-2 



Asw/wffl uW for all w, uW it follows: 



(1 — 2\ 

E^, t (/(w)/(w©u«)) = ^1 = 2 ^K l } ir ( 4 ) 
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Hence substituting Equations (2), (3) and (4) into Equation (1) leads to 



. K(2 K -t)t 

^F K Jsf) = 2 k-i (2 k —y 

Finally the claimed expression for E F fixod(s/) can be obtained from the above 
equation by a substitution of t: t — ► p2 K . □ 

It should be noted, that the Theorems 1 and 2 can be proved using in a 
similar way. Also worth noting is the fact, that if the functions are chosen 
according Fx, F^ d or Fk, p the expectation of the sensitivity of a fixed vector 
w (namely the expectation of s/(w)) is independent of w (see Equation (1),(2), 
(3) and (4)). Hence the following lemma holds 

Lemma 1. If F = F K ,F^ d or F K , p , then 

Vw,v e Ff : E F (s/(w)) = E F (s/(v)) 



Before proceeding to the next section, it should be noted, that using the same 
arguments as in the proof of Theorem 3, we can also prove the expectation of 
average sensitivity of order I , defined as 

S {l \f) = 2- K ]T |{xeFf|w H (x)=/and/(w)^/(wex)}|. 

In this case, instead of summing up all unit vectors in Equation (1), we sum 
up all vectors of Hamming weight I. As the equations (2) and (4) hold for all 
w e ¥2 we conclude that 

= (?) 2 -^P^ 

and by similar arguments 

E( S V\F K , p ))=( K i y p (l-p) 

respectively 



4 Extending Lynch's analysis 

As already mentioned James Lynch gave a very general analysis of randomly 
constructed Boolean networks (see [4]). Before stating his results we give a 
formal definition for Boolean networks A Boolean network B is a 4-tuplc 
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(V,E, F, x) where V = {1,...,N} is a set of natural numbers, E is a set of 
labeled edges on V, F — {/i, /at} is a ordered set of Boolean functions such 
that for each v G V the number of arguments of f v is the in- degree of v in 
S, these edges are labeled with 1, in- degree (v), and x = (xi, . . . x„) G F^. 
Suppose that a vertex i has Ki in-edges from vertices v^i, . . . , u,,^ . For y G 
we define 

B(y) = (/i(y Vi,i ) • • • ) Yvi.K--, 

The state of B at time is called the initial state x, so we define B° (x) = x. For 
time t > 1 the state is inductively defined as B'(x) = B(B' _1 (x)). Hence we 
can in interpret V as set of gates, E and F describes their functional dependence 
and x is the networks initial state. 

Assume some ordering /i, / 2 , ... on the set of all Boolean functions T, where 
each function fi depends on Ki arguments. Further a random variable F : Q — > 
T with probabilities ^ = P(F = fi) such that Y^HLiPi = 1 an d Y^iLiPi^-i < 
oo. Now a random Boolean network consisting of AT gafes is constructed as 
follows: For each gate a Boolean function is chosen independently, where the 
probability of choosing /, is given by Pi. Suppose a function / was chosen that 
has K arguments, these arguments are chosen at random from all (^) equally 
likely possibilities. At last an initial state is chosen at random from the set on 
all equally likely states. If the Boolean functions are chosen according to our 
previously defined random variable Fk we will call this networks iVif-Networks 
with connectivity K. If the functions are chosen according to F^ x ^ d or Fk, p 
we will call this networks biased random Boolean networks with connectivity K 
and fixed bias p respectively mean bias p. 

Let us now state Lynch's results. His analysis depends on a parameter 
K 3 A > depending only on the functions and their probabilities. We will 
define A later in Definition 3. First we have to state Lynch's definition of 
freezing and ineffective gates: 

Definition 2 (Lynch [4] Definition 1 Item 2 and 5). 
Let xeFf andvG V. 

1. Gate v freezes to y G F^ in t steps on input x i/B* (x) = y for all t' > t. 

2. Let u« £ F£. 

A gate v is t-ineffective at input x G Ff if B*(x) = B*(x © u (t,) ). 

Now we will state the main result. 
Theorem 4 (Lynch [4] Theorem 4 and 6). 

Let a, (3 be positive constants satisfying 2a log S + 2(3 < 1 and a log 5 < [3 where 
5 = E(Ki). 

1. There is a constant r such that for all x G F^ 

lim P(v is ineffective in a log N steps) = r 

n— *oo 

When A < 1, r = 1 and when A > 1 , r < 1. 
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2. There is a constant r such that for all x G 

lim P(v is freezing in a log N steps) = r 

When A < 1, r = 1 and when A > 1 , r < 1. 3 

The above theorem shows that if A < 1 almost all gates are freezing and 
ineffective and otherwise not. The next corollary gives us more information 
what happens if A > f : 

Corollary 1 (Lynch [4] Corollary 3 and Corollary 6). Let A > 1. For almost 
all random Boolean networks 

1. if gate v is not a log N -ineffective, there is a positive constant W such that 
for t < a log N, the number of gates affected by v at time t is asymptotic 
to WX f , 

2. if gate v is not freezing in a log N steps , there is a positive constant W 
such that for t < alogiV, the number of gates that affect v at time t is 
asymptotic to W\ l . 

Now we will state the definition of A for Boolean networks: 

Definition 3 (Lynch [4], Definition 4). Let f be a Boolean function of K ar- 
guments. For i G {1, . . . , K} 7 we say that argument i directly affects f on input 
w G Ff if /(w) 7^ /(w © uW). Now put 7(/, w) as the number of i's that 
directly affect f on input w. Given a constant a G [0, 1], we define 

oo 

A = E^ E 7(.A,w) a WH ( w )(l-a)^- WH < w >. 



=i 



w£F 



Obviously j(f,w) is identical to s/(w) which will be used instead in the 
further discussion. The constant a is the probability that a random gate is one 
(at infinite time) given that all gates at time have probability 0.5 of being one. 
(see [4, Definiton 2]). Assume that we choose the functions according a random 
variable F which should be either Fx, Ff?p d or Fk, p - The functions are chosen 
out the set Tki we denote a function's probability with pj. It follows that 

A= «"» W (1-<- Wh(w) Ep/ s /H ( 5 ) 

= ^ a WH(w) (l-a) K - WH(w) E(s F (w)) (6) 

= E(s F (w))f2(*)a i (l-a) K - i (7) 
»=o ^ ' 

= E{s F (w))=E F (s f ) (8) 



3 Please note that we here state a slightly weaker result than in the original analysis. 
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E(si?(w)) denotes the expectation of the sensitivity for a fixed w, Equation (7) 
follows from Lemma 1. Therefore, together with Theorem 1 and Theorem 3 we 
proved the following: 

Theorem 5 (Biased random Boolean networks). For random Boolean networks, 
if 

1. the functions are chosen according random variable Fk, p , it follows that 

\ = 2Kp(l-p), 

2. the functions are chosen according random variable F^ d , it follows that 

_ 2 K+1 Kp(l-p) 



As a special case of the above theorem we get (or by using Theorem 1) 

Theorem 6 ( TV if -Networks) . In random Boolean networks, where the functions 
are chosen according to the random variable Fx 



5 Discussion 



The results about iV-fT-Networks are consistent with experimental results. In 
fact if K < 2 almost all networks almost all gates are freezing and almost all 
gates are ineffective and otherwise not (see [2]). 

Obviously, the border between the ordered and disordered phase is given 
by A = 1. The resulting phase diagram for biased random Boolean networks, 
where the functions are chosen according to -F^- X ° d and Fk. p is shown in Figure 
1. It it interesting to note that if the functions are chosen with fixed bias, then 
also Boolean networks with connectivity K = 2 can become unstable. This 
conclusion can be drawn from Lynch's original result already. As mentioned in 
the introduction, he showed for K = 2, that A > 1 if the probability of choosing 
a non-constant non-canalizing function, namely the XOR or the inverted XOR 
function, is larger than the probability of choosing a constant function. For 
example if the bias is 0.5, the probability of choosing a constant function is 
zero, whereas both XOR and inverted XOR function have probability greater 
zero, hence A > 1. 

It is interesting to compare our results with previous results obtained first 
by Derrida and Pomeau using the so called annealed approximation (see [3]). 
In their annealed model the functions and connections are chosen at random 
at each time step. Considering two instances of the same annealed network 
starting in two randomly chosen initial states si(0), s 2 (0) they show that 

lim lim dH( Sl (*), S2 (*)) =c 
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Figure 1: Phase diagram for biased random networks: Functions chosen accord- 
ing F KiV (dashed) and F|™ d (solid) 

where c = 1 if 

2Kp(l - p) < 1 

and c < 1 otherwise. It is remarkable that the two models behave similar, but 
it is unclear whether this holds in general. 
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